Unify extract_patches to support both 2D and 3D patches #21980

MarcosAsh · 2026-01-04T19:27:10Z

Summary

Unified extract_patches to handle both 2D and 3D patch extraction based on the size argument, similar to how keras.ops.conv handles different dimensions.

Changes

size=(h, w) -> 2D patch extraction. This already existed.
size=(d, h, w) -> 3D patch extraction

Why

As suggested in #21624, having a unified API is more consistent with other Keras ops like conv.

Tests

Updated test_extract_patches_invalid_size to reflect new valid 3D sizes.
Added test_extract_patches_unified_3d for the new functionality.
All existing extract_patches tests pass.

Relates to #21624

…eras-team#21624

gemini-code-assist · 2026-01-04T19:27:28Z

Summary of Changes

Hello @MarcosAsh, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly enhances the keras.ops.image.extract_patches function by unifying its API to seamlessly handle both 2D image and 3D volume patch extraction. By intelligently interpreting the size argument, the function now provides a consistent interface, mirroring the flexibility found in other Keras operations like keras.ops.conv. This change improves the function's versatility and aligns it with Keras's goal of providing intuitive and unified APIs for various data dimensions.

Highlights

Unified API for Patch Extraction: The extract_patches function now supports both 2D image and 3D volume patch extraction through a single, unified API. The dimensionality is determined by the length of the size argument.
3D Patch Extraction Support: The function now accepts a size tuple of length 3 (e.g., (depth, height, width)) for extracting patches from 3D volumes, similar to how keras.ops.conv handles different dimensions.
Updated Documentation and Examples: The docstring for extract_patches has been updated to reflect the new 3D capabilities, including detailed explanations of input/output shapes and new examples demonstrating 3D patch extraction.
Enhanced Test Coverage: New tests (test_extract_patches_unified_3d) have been added to validate the correct behavior of 3D patch extraction for both batched and unbatched inputs. Existing tests for invalid size arguments (test_extract_patches_invalid_size) were also refined.

🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request unifies the extract_patches function to support both 2D and 3D patches, which is a great step towards a more consistent API. The implementation looks good, and the added tests cover the new functionality well.

I have a few suggestions to further improve the user experience:

The docstrings for images, size, and data_format could be more explicit to improve clarity, especially with the new 3D support.
I've also suggested adding more robust input validation at the beginning of the extract_patches function to provide clearer error messages for invalid size arguments, along with the corresponding test updates. This aligns with Keras' API design guidelines for helpful error messages.

gemini-code-assist · 2026-01-04T19:30:08Z

keras/src/ops/image.py

+        images: Input image/volume or batch of images/volumes.
+            For 2D patches: 3D `(H, W, C)` or 4D `(N, H, W, C)`.
+            For 3D patches: 4D `(D, H, W, C)` or 5D `(N, D, H, W, C)`.
+        size: Patch size as int or tuple.
+            Length 2 tuple `(patch_height, patch_width)` for 2D patches.
+            Length 3 tuple `(patch_depth, patch_height, patch_width)` for
+            3D patches.
+        strides: Strides for patch extraction. If not specified, defaults
+            to `size` (non-overlapping patches).
+        dilation_rate: Dilation rate for patch extraction. Note that
+            `dilation_rate > 1` is not supported with `strides > 1`.
        padding: The type of padding algorithm to use: `"same"` or `"valid"`.
        data_format: A string specifying the data format of the input tensor.
            It can be either `"channels_last"` or `"channels_first"`.
-            `"channels_last"` corresponds to inputs with shape
-            `(batch, height, width, channels)`, while `"channels_first"`
-            corresponds to inputs with shape `(batch, channels, height, width)`.
-            If not specified, the value will default to
-            `keras.config.image_data_format`.
+            If not specified, defaults to `keras.config.image_data_format`.


The docstrings for images, size, and data_format could be more explicit to improve clarity, especially with the new 3D support. The current images docstring assumes channels_last without stating it, the size docstring is ambiguous about integer values, and the data_format docstring is a bit sparse. I suggest clarifying these points for a better user experience.

images: Input image/volume or batch of images/volumes. Assumes `channels_last` data format. For 2D patches: 3D `(H, W, C)` or 4D `(N, H, W, C)`. For 3D patches: 4D `(D, H, W, C)` or 5D `(N, D, H, W, C)`. size: The size of the patches to extract. - If `size` is an integer, 2D patches of size `(size, size)` are extracted. - If `size` is a tuple of 2 integers, 2D patches of size `(patch_height, patch_width)` are extracted. - If `size` is a tuple of 3 integers, 3D patches of size `(patch_depth, patch_height, patch_width)` are extracted. strides: Strides for patch extraction. If not specified, defaults to `size` (non-overlapping patches). dilation_rate: Dilation rate for patch extraction. Note that `dilation_rate > 1` is not supported with `strides > 1`. padding: The type of padding algorithm to use: `"same"` or `"valid"`. data_format: A string specifying the data format of the input tensor. It can be either `"channels_last"` or `"channels_first"`. `"channels_last"` corresponds to inputs with shape `(batch, ..., channels)`, while `"channels_first"` corresponds to inputs with shape `(batch, channels, ...)`. If not specified, defaults to `keras.config.image_data_format`.

keras/src/ops/image.py

gemini-code-assist · 2026-01-04T19:30:08Z

keras/src/ops/image_test.py

    def test_extract_patches_invalid_size(self):
-        size = (3, 3, 3)  # Invalid size, too many dimensions
+        size = "5"  # Invalid size type
        image = np.random.uniform(size=(2, 20, 20, 3))
        with self.assertRaisesRegex(
            TypeError, "Expected an int or a tuple of length 2"
        ):
            kimage.extract_patches(image, size)

-        size = "5"  # Invalid size type
+        size = (3, 3, 3, 3)  # Invalid size, too many dimensions
        with self.assertRaisesRegex(
            TypeError, "Expected an int or a tuple of length 2"
        ):
            kimage.extract_patches(image, size)


This test should be updated to reflect the improved validation in extract_patches. The error types and messages for invalid size will change with the suggested validation logic.

Suggested change

def test_extract_patches_invalid_size(self):

size = (3, 3, 3) # Invalid size, too many dimensions

size = "5" # Invalid size type

image = np.random.uniform(size=(2, 20, 20, 3))

with self.assertRaisesRegex(

TypeError, "Expected an int or a tuple of length 2"

):

kimage.extract_patches(image, size)

size = "5" # Invalid size type

size = (3, 3, 3, 3) # Invalid size, too many dimensions

with self.assertRaisesRegex(

TypeError, "Expected an int or a tuple of length 2"

):

kimage.extract_patches(image, size)

def test_extract_patches_invalid_size(self):

size = "5" # Invalid size type

image = np.random.uniform(size=(2, 20, 20, 3))

with self.assertRaisesRegex(

TypeError, "Expected an int or a tuple"

):

kimage.extract_patches(image, size)

size = (3, 3, 3, 3) # Invalid size, too many dimensions

with self.assertRaisesRegex(

ValueError, "Expected a tuple of length 2 or 3"

):

kimage.extract_patches(image, size)

codecov-commenter · 2026-01-04T19:35:45Z

Codecov Report

❌ Patch coverage is 77.77778% with 2 lines in your changes missing coverage. Please review.
✅ Project coverage is 82.70%. Comparing base (43fa477) to head (e4281a1).
⚠️ Report is 27 commits behind head on master.

Files with missing lines	Patch %	Lines
keras/src/ops/image.py	77.77%	1 Missing and 1 partial ⚠️

Additional details and impacted files

@@           Coverage Diff            @@
##           master   #21980    +/-   ##
========================================
  Coverage   82.69%   82.70%            
========================================
  Files         588      588            
  Lines       61369    61570   +201     
  Branches     9612     9641    +29     
========================================
+ Hits        50749    50919   +170     
- Misses       8135     8156    +21     
- Partials     2485     2495    +10

Flag	Coverage Δ
keras	`82.52% <77.77%> (+<0.01%)`	⬆️
keras-jax	`61.48% <77.77%> (-0.10%)`	⬇️
keras-numpy	`56.75% <77.77%> (-0.08%)`	⬇️
keras-openvino	`37.45% <0.00%> (+0.09%)`	⬆️
keras-tensorflow	`63.65% <77.77%> (-0.09%)`	⬇️
keras-torch	`62.40% <77.77%> (-0.09%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

hertschuh

Thank you for the refactor!

One small comment:

hertschuh · 2026-01-09T21:40:30Z

keras/src/ops/image.py

+            For 2D patches: 3D `(H, W, C)` or 4D `(N, H, W, C)`.
+            For 3D patches: 4D `(D, H, W, C)` or 5D `(N, D, H, W, C)`.
+        size: Patch size as int or tuple.
+            Length 2 tuple `(patch_height, patch_width)` for 2D patches.


Small tweak: mention that if the size is an int it will do 2D patches:

size: Patch size as int or tuple. Length 2 tuple `(patch_height, patch_width)` or int for 2D patches.

MarcosAsh · 2026-01-11T15:13:37Z

Hey just committed the changes thanks for the help!

hertschuh

Thank you for this.

Would you be able to do this as a next step in a separate PR?

This is to fully merge both ops in a single one and have only one op class:

rename the current _extract_patches to _extract_patches_2d
create a new _extract_patches function that does

    if not isinstance(size, int) and len(size) == 3:
      return _extract_patches_3d(...)
    else:
      return _extract_patches_2d(...)

Note that this will make ExtractPatches work with 3D too.

revert extract_patches so that it only does:

    if any_symbolic_tensors((images,)):
        return ExtractPatches(
            size=size,
            strides=strides,
            dilation_rate=dilation_rate,
            padding=padding,
            data_format=data_format,
        ).symbolic_call(images)

    return _extract_patches(
        images, size, strides, dilation_rate, padding, data_format=data_format
    )

Remove the ExtractPatches3D class

That would be great. Thanks!

Unify extract_patches to support both 2D and 3D patches. Relates to k…

3a29684

…eras-team#21624

google-ml-butler bot added the size:M label Jan 4, 2026

google-ml-butler bot assigned gbaned Jan 4, 2026

gemini-code-assist bot reviewed Jan 4, 2026

View reviewed changes

Add input validation and improve error messages per review

d41c0df

hertschuh reviewed Jan 9, 2026

View reviewed changes

hertschuh added the stat:awaiting response from contributor label Jan 9, 2026

Update docstring to clarify int size for 2D patches

e4281a1

google-ml-butler bot removed the stat:awaiting response from contributor label Jan 11, 2026

hertschuh approved these changes Jan 12, 2026

View reviewed changes

google-ml-butler bot added kokoro:force-run ready to pull Ready to be merged into the codebase labels Jan 12, 2026

kokoro-team removed the kokoro:force-run label Jan 12, 2026

hertschuh merged commit 026022d into keras-team:master Jan 12, 2026
13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Unify extract_patches to support both 2D and 3D patches #21980

Unify extract_patches to support both 2D and 3D patches #21980

MarcosAsh commented Jan 4, 2026 •

edited

Loading

Uh oh!

gemini-code-assist bot commented Jan 4, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Jan 4, 2026

Uh oh!

Uh oh!

gemini-code-assist bot Jan 4, 2026

Uh oh!

codecov-commenter commented Jan 4, 2026 •

edited

Loading

Uh oh!

hertschuh left a comment

Uh oh!

hertschuh Jan 9, 2026

Uh oh!

MarcosAsh commented Jan 11, 2026

Uh oh!

hertschuh left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Unify extract_patches to support both 2D and 3D patches #21980

Unify extract_patches to support both 2D and 3D patches #21980

Conversation

MarcosAsh commented Jan 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Why

Tests

Uh oh!

gemini-code-assist bot commented Jan 4, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Jan 4, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

gemini-code-assist bot Jan 4, 2026

Choose a reason for hiding this comment

Uh oh!

codecov-commenter commented Jan 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

hertschuh left a comment

Choose a reason for hiding this comment

Uh oh!

hertschuh Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

MarcosAsh commented Jan 11, 2026

Uh oh!

hertschuh left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

MarcosAsh commented Jan 4, 2026 •

edited

Loading

codecov-commenter commented Jan 4, 2026 •

edited

Loading